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Abstract. Can we do arithmetic in a completely different way, with a 
radically different data structure? Could this approach provide practi- 
cal benefits, like operations on giant numbers while having an average 
performance similar to traditional bitstring representations? 
While answering these questions positively, our tree based representation 
described in this paper comes with a few extra benefits: it compresses 
giant numbers such that, for instance, the largest known prime number as 
well as its related perfect number are represented as trees of small sizes. 
The same also applies to Fermat numbers and important computations 
like exponentiation of two become constant time operations. 
At the same time, succinct representations of sparse sets, multisets and 
sequences become possible through bijections to our tree-represented nat- 
ural numbers. 

1 Introduction 

If extraterrestrials were to do arithmetic computations, would one assume that 
their numbering system should look the same? At a first thought, one might 
be inclined to think so, maybe with the exception of the actual symbols used 
to denote digits or the base of the system likely to match numbers of fingers 
or some other anthropocentric criteria. After some thinking about the endless 
diversity of the universe (or unconventional models of Peano's axioms), one might 
also consider the possibility that the departure from our well-known number 
representations could be more significant. 

With more terrestrial uses in mind, one would simply ask: is it possible to do 
arithmetic differently, possibly including giant numbers and a radically different 
underlying data structure, while maintaining the same fundamentals - addition, 
multiplication, exponentiation, primality, Peano's axioms behaving in the same 
way? Moreover, can such exotic arithmetic be as efficient as binary arithmetic, 
while possibly supporting massively parallel execution? 

We have shown in [T] that type classes and polymorphism can be used to 
share fundamental operations between natural numbers, finite sequences, sets 
and multisets. As a consequence of this line of research, we have discovered 
that it is possible to transport the recursion equations describing binary number 
arithmetics on natural numbers to various tree types. 



However, the representation proposed in this paper is different. It is arguably 
simpler, more flexible and likely to support parallel execution of operations. It 
also allows to easily compute average and worse case complexity bounds. We 
will describe it in full detail in the next sections, but for the curious reader, it is 
essentially a recursively self-similar run-length encoding of bijective base-2 digits. 

The paper is organized as follows. Section [2] describes our type class used to 
share generic properties of natural numbers. Section [3] describes binary arith- 
metic operations that are specialized in section [4] to our compressed tree rep- 
resentation. Section [5] describes efficient tree-representations of some important 
number-theoretical entities like Mersenne, Fermat and perfect numbers. Section 
[6] shows that sparse sets, multisets and lists have succinct tree representation. 
Section [7] describes generic isomorphisms between data types, centered around 
transformations of instances of our type class to their corresponding sets, multi- 
sets and lists. Section [8] shows interesting complexity reductions in other compu- 
tations and section [9] compares the performance of our tree-representation with 
conventional ones. Section [10] discusses related work. Section QT] concludes the 
paper and discusses future work. 

To provide a concise view of our compressed tree data type and its oper- 
ations, we will use the strongly typed functional language Haskell as a pre- 
cise means to provide an executable specification. We have adopted a literate 
programming style, i.e. the code contained in the paper forms a self-contained 
Haskell module (tested with ghc 7.4.1), also available as a separate file at http: 
|//logic . cse .Tint . ed u/tarau/research/2013/gian t ,hs| . Also, a Scala pack- 
age implementing the same tree-based computation is available from http : 
//code .google . com/p/giant-numbers/ We hope that this will encourage the 
reader to experiment interactively and validate the technical correctness of our 
claims. 

We mention, for the benefit of the reader unfamiliar with Haskell, that a 
notation like f x y stands for f(x,y), [t] represents sequences of type t and a 
type declaration like f : : s -> t -> u stands for a function / : sxt — > u (mod- 
ulo Haskell's "currying" operation, given the isomorphism between the function 
spaces s x t — > u and s — > t — > u). Our Haskell functions are always represented 
as sets of recursive equations guided by pattern matching, conditional to con- 
straints (simple arithmetic relations following I and before the = symbol). Locally 
scoped helper functions are defined in Haskell after the where keyword, using 
the same equational style. The composition of functions f and g is denoted f . 
g. It is also customary in Haskell, when defining functions in an equational style 
(using =) to write f = g instead of / x = g x ( "point- free" notation). The use 
of Haskell's "call-by-need" evaluation allows us to work with infinite sequences, 
like the [0 . . ] infinite list notation, corresponding to the set N itself. 

Our literate Haskell program is organized as the module Giant depending on 
a few packages, as follows: 

{-# LANGUAGE NoMonomorphismRestriction #-} 
— needed to define toplevel isomorphisms 
module Giant where 



— cabal install data-ordlist is required before importing this 

import Data. List . Ordered 

import Data. List hiding (unionBy) 

import System. CPUTime 

2 Sharing axiomatizations with type classes 

Haskell's type classes [5] are a good approximation of axiom systems as they 
describe properties and operations generically i.e. in terms of their action on 
objects of a parametric type. Haskell's type instances approximate interpreta- 
tions 3] of such axiomatizations by providing implementations of the primitive 
operations, with the added benefit of refining and possibly overriding derived 
operations with more efficient equivalents. 

We start by defining a type class that abstracts away properties of binary 
representations of natural numbers. 

The class N assumes only a theory of structural equality (as implemented by 
the class Eq in Haskell). It implements a representation- independent abstraction 
of natural numbers, allowing us to compare our tree representation with "ordi- 
nary" natural numbers represented as non-negative arbitrary large Integers in 
Haskell, as well as with a binary representation using bijective base-2 0]. 

class Eq n => N n where 

An instance of this class is required to implement the following 6 primitive 
operations: 

e : : n 

o , o ' , i , i ' : : n— >n 
o_ : : n— >Bool 

The constant function e can be seen as representing the empty sequence of binary 
digits. With the usual representation of natural numbers, e will be interpreted 
as 0. The constructors o and i can be seen as applying a function that adds a 
or 1 digit to a binary string on {0,1}*. The deconstructors o' and i' undo 
these operations by removing the corresponding digit. The recognizer o_ detects 
that the constructor o is the last one applied, i.e. that the "string ends with the 
symbol. It will be interpreted on N as a recognizer of odd numbers. 

This type class also endows its instances with generic implementations of the 
following derived operations: 



e_ 


, i_ : : n— >Bool 




e_ 


x = x = e 




i_ 


x = not (e_ x 


o_ x) 



with structural equality used implicitly in the definition of the recognizer pred- 
icate for empty sequences e_ and with the assumption that the domain is ex- 
hausted by the three recognizers in the definition of the recognizer i_ of sequences 
ending with 1, representing even positive numbers in bijective base 2. 

Successor s and predecessor s ' functions are implemented in terms of these 
operations as follows: 



s,s' : : n— >n 



s x | e_ x = o x 

s x | o_ x = i (o' x) 

s x | i_ x = o (s (i ' x) ) 

s ' x I x = o e = e 

s ' x I i_ x = o (i ' x) 

s' x | o_ x = i (s' (o' x)) 

By looking at the code, one might notice that our generic definitions of operations 
mimic recognizers, constructors and destructors for bijective base-2 numbers, i.e. 
sequences in the language {0, 1}*, similar to binary numbers, except that is 
represented as the empty sequence and left-delimiting by 1 is omitted. 

Proposition 1 Assuming average constant time for recognizers, constructors 
and destructors e_,o_o,i_,i,o' successor and predecessor s ands' are con- 
stant time, on the average. 

Proof. Clearly the first two rules are constant time for both s and s ' as they do 
not make recursive calls. To show that the third rule applies recursion a constant 
number of times on the average, we observe that the recursion steps are exactly 
given by the number of 0s or Is that a (bijective base-2 number) ends with. As 
only half of them end with a and another half of those end with another etc. 
one can see that the average number of 0s is bounded by \ + \ + . . . = 1. The 
same reasoning applies to the average number of Is a number can end with. 

The infinite stream of generic natural numbers is generated by iterating over the 
successor operation s: 

allFrom : : n— > [n] 
allFrom x = iterate s x 



3 Efficient arithmetic operations, generically 



We will first show that all fundamental arithmetic operations can be described 
in this abstract, representation-independent framework. This will make possible 
creating instances that, on top of symbolic tree representations, provide im- 
plementations of these operations with asymptotic efficiency comparable to the 
usual bitstring operations. 

We start with addition (add) and subtraction (sub): 



add 

add x y 
add x y 
add x y 
add x y 
add x y 



e_ x 

e_ y 

o_ x 
o_ x 



i_ x kk o. 



y = i (add (o' x) (o' y) ) 

y = o (s (add (o> x) (i> y))) 

y = o (s (add (i' x) (o' y))) 



add x y | i_ x kk i_ y = i (s (add (i' x) (i' y))) 
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(o (sub (i' y) 


(i> x))) 



It is easy to see that addition and subtraction are implemented generically, with 
asymptotic complexity proportional to the size of the operands. Comparison 
provides the expected total order of N on our type class: 



cmp : : n 
cmp x y 
cmp x _ 
cmp _ y 
cmp x y 
cmp x y 
cmp x y 

down EQ = LT 

down r = r 
cmp x y | i_ x 

up EQ = GT 

up r = r 



n— > Ordering 
e_ x kk e_ y 
e_ x = LT 
e_ y = GT 
o_ x kk o_ y 



= EQ 



cmp (o' x) (o' y) 
y = cmp (i' x) (i' y) 
y = down (cmp (o' x) (i' y)) where 



o_ y = up (cmp (i' x) (o' y)) where 



And based on it one can define the minimum min2 and maximum max2 functions 
as follows: 

min2,max2 : : n— >n— >n 

min2 x y = if LT=cmp x y then x else y 
max2 x y = if LT=cmp x y then y else x 

Next, we define multiplication: 

mul : : n— >n— >n 
mul x | e_ x = e 
mul y | e_ y = e 

mul x y = s (m (s' x) (s' y)) where 
m x y | e_ x = y 
m x y | o_ x = o (m (o' x) y) 



as well as double of a number db and half of an even number hf , having both 
simple expressions: 



db.hf : : n— >n 
db = s' . o 
. i' 



Power is defined as follows: 



pow : : n— >n— >n 

pow _ y | e_ y = o e 




pow x y | o_ y = mul x (pow (mul x x) (o' y) ) 
pow x y | i_ y = mul (mul x x) (pow (mul x x) 



together with more efficient special instances, exponent of 2 (exp2) and multi- 
plication by a power of 2 (lef tshift): 

exp2 : : n— >n 

exp2 x | e_ x = o e 

exp2 x = db (exp2 (s' x) ) 

leftshift : : n— >n— >n 
leftshift x y = mul (exp2 x) y 

Finally, division and reminder on N is a bit trickier but can be expressed gener- 
ically as well: 

div_and_rem :: n— >n— Kn,n) 

div_and_rem x y | LT = cmp x y = (e,x) 
div_and_rem x y | not (e_ y) = (q,r) where 

(qt.rm) = divstep x y 

(z,r) = div_and_rem rm y 

q = add (exp2 qt) z 

divstep : : N n => n— >n— Kn,n) 
divstep n m = (q, sub n p) where 

q = try_to_double n m e 

p = mul (exp2 q) m 

try_to_double x y k = 
if (LT=cmp x y) 
then s ' k 

else try_to_double x (db y) (s k) 
divide, reminder :: n— >n— >n 
divide n m = fst (div_and_rem n m) 



And for the reader curious by now about how this maps to "arithmetic as 
usual", here is an instance built around the (arbitrary length) Integer type, 
also usable as a witness on the time/space complexity of our operations. 

instance N Integer where 

e = 

o_ x = odd x 
o x = 2*x+l 

o' x | odd x && x > = (x-1) 'div' 2 




i x = 2*x+2 



i' x I even x && x > = (x-2) 'div' 2 

An instance mapping our abstract operations to actual constructors, follows in 
the form of the datatype B 

data B = B | B | I B deriving (Show, Read, Eq) 

instance N B where 
e = B 

= 

1 = I 

o' (0 x) = x 
i' (I x) = x 

o_ (0 _) = True 
o = False 

One can try out various operations on these instances: 

*Giant> mul 10 5 
50 

*Giant> exp2 5 
32 

*Giant> add (0 B) (I (0 B)) 
(I B) 

4 Computing with our compressed tree representations 

We will now show how our shared axiomatization framework can be implemented 
as a new, somewhat unusual instance, that brings the ability to do arithmetic 
computations with trees. 

First, we define the data type for our tree represented natural numbers: 

data T = T | V T [T] W T [T] deriving (Eq, Show, Read) 

The intuition behind this "union type" is the following: 

— The type T corresponds to an empty sequence 

— the type V x xs counts the number x of o applications followed by an alter- 
nation of similar counts of i and o applications 

— the type W x xs counts the number x of i applications followed by an alter- 
nation of similar counts of o and i applications 

— the same principle is applied recursively for the counters, until the empty 
sequence is reached 

One can see this process as run-length compressed bijective base-2 numbers, rep- 
resented as trees with either empty leaves or at least one branch, after applying 
the encoding recursively. First we define the 6 primitive operations: 



instance N T where 
e = T 



o T = V T [] 

o (V x xs) = V (s x) xs 
o (W x xs) = V T (x:xs) 



i T = W T [] 

i (V x xs) = W T (x:xs) 
i (W x xs) = W (s x) xs 



o' (V T []) = T 

o' (V T (x:xs)) = W x xs 

o' (V x xs) = V (s' x) xs 



i' CW T []) = T 

i' (W T (x:xs)) = V x xs 

i' (W x xs) = W (s' x) xs 



o_ (V _ _ ) = True 
o_ _ = False 



Next, we override two operations involving exponents of 2 as follows 

exp2 = exp2' where 
exp2' T = V T [] 
exp2' x = s (V (s' x) []) 

leftshift = leftshift' where 
leftshift' T = T 

leftshift' n y | o_ y = s (vmul n (s' y) ) 



The leftshift' operation uses an efficient implementation, specialized for the 
type T, of the repeated application (n times) of constructor o, over the second 
argument of the function vmul: 



vmul T y = y 
vmul n T = V (s' 
vmul n (V y ys) 
vmul n (W y ys) 



n) [] 

= V (add (s' n) y) 
= V (s' n) (y:ys) 



Note that such overridings take advantage of the specific encoding, as a re- 
sult of simple number theoretic observations. For instance, the operation exp2 ' 
works in time proportional to s and s ' , that can be shown to be constant on the 
average. The more complex leftshift' operation observes that repeated appli- 
cation of the o operation, when adjusted based on being even or odd, provides 
an efficient implementation of multiplication with an exponent of 2. 

It is convenient at this point, as we target a diversity of interpretations ma- 
terialized as Haskell instances, to provide a polymorphic converter between two 



different instances of the type class N as well as their associated lists, imple- 
mented by structural recursion over the representation to convert. The function 
view allows importing a wrapped object of a different instance of N, generically. 

view :: (N a,N b)=>a— >b 

view x | e_ x = e 

view x | o_ x = o (view (o' x)) 

view x | i_ x = i (view (i' x)) 

We can specialize view to provide conversions to our three data types, each 
denoted with the corresponding lower case letter, tt t, b and n for the usual 
natural numbers. 

t : : (N n) => n — > T 
t = view 

b : : (N n) => n — > B 
b = view 

n : : (N n) => n — » Integer 

One can try them out as follows: 

*Giant> t 42 

W (V T []) [T.T.T] 

*Giant> b it 

I (I (0 (I (0 B)))) 

*Giant> n it 

42 

While space constraints forbid us from providing the correctness proofs of 
operations like exp2' and leftshift', we are able to illustrate their expected 
usage as follows: 

*Giant> t 5 
V T [T] 

*Giant> exp2 it 

W T [V (V T []) []] 

*Giant> n it 

32 

*Giant> t 10 

W (V T []) [T] 

*Giant> leftshift it (t 1) 

W T [W T [V T []]] 

*Giant> n it 

1024 



5 Efficient representation of some important 
number-theoretical entities 

Let's first observe that Fermat, Mersenne and perfect numbers have all compact 
expressions with our tree representation of type T. 

fermat n = s (exp2 (exp2 n) ) 
mersenne p = s' (exp2 p) 

perfect p = s (V q [q] ) where q = s' (s' p) 

And one can also observe that this contrasts with both the Integer repre- 
sentation and the bijective base-2 numbers B: 

*Giant> mersenne (b 127) 

(0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 

... a few lines of 0s and Is 

))))))))))))))))))))))))))))))))))))))))) 

*Giant> mersenne (n 127) 
170141183460469231731687303715884105727 
*Giant> mersenne (t 127) 

V (W (V T [T] ) [] ) [] 

The largest known prime number, found by the GIMP distributed computing 
project [5] is the 45-th Mersenne prime = 2 43112609 — 1. It is defined in Haskell 
as follows: 

— its exponent 

prime45 = 43112609 : : Integer 

— the actual Mersenne prime 
mersenne45 = s' (exp2 (t p)) where 

p = prime45 : : Integer 

While it has a bitsize of 43112609, we have observed that its compressed tree 
representation using our type T is rather small: 

*Giant> mersenne45 

V (W T [V (V T []) [],T,T,T,W T [] ,V T [] ,T,W T [] ,W T [] ,T,V T [],T,T]) [] 

One the other hand, displaying it with a decimal or binary representation would 
take millions of digits. 

And by folding replicated subtrees to obtain an equivalent DAG represen- 
tation, one can save even more memory. Figure [l] shows this representation, 
involving only 6 nodes. 

It is interesting to note that similar compact representations can also be 
derived for perfect numbers. For instance, the largest known perfect number, 
derived from the largest known Mersenne prime as 2 43112609_1 (2 43112609 — 1), is: 

perfect45 = perfect (t prime45) 




Fig. 2. Largest known perfect number 



Fig. [2] shows its DAG representation involving only 7 nodes. Similarly, the 
largest Fermat number that has been factored so far, Fll=2 2 + 1 is compactly 
represented as 

*Giant> fermat (t 11) 

V T [T,V T [W T [V T []]]] 

By contrast, its (bijective base-2) binary representation consists of 2048 digits. 

6 Representing sparse sets, multisets and lists 

We will now describe bijective mappings between collection types as well as 
a Godel numbering scheme putting them in bijection with natural numbers. 
Interestingly, natural number encodings for sparse instances of these collections 
will have space-efficient representations as natural numbers of type T, in contrast 
with bitstring or conventional Integer-based representations. 

The type class Collections will convert between natural numbers and lists, 
by using the bijection f(x,y) = 2 x (2y + 1), implemented by the function c and 
its first and second projections c' and c' ' , inverting it. 



class (N 


n) => Collections n where 




c : : n- 


-^n^n 




c' ,c" 


: : n— >n 




c x y = 


= mul (exp2 x) (0 y) 




C ' X | 


not (e_ x) = if o_ x then e else s (c' 


(hf x)) 


c" X i 


not (e_ x) = if o_ x then 0' x else c' 


' (hf x) 



The bijection between natural numbers and lists of natural numbers, to_list 
and its inverse from_list apply repeatedly c and respectively c' and c". 



to_list : : 


n^ [n] 




to_list x 


e_ x = [] 




to_list x 


= (c* x) : (to_list (c' 


' x)) 


f rom_list : 


: [n] ->n 




f rom_list 


[] = e 




f rom_list 


(x:xs) = c x (from_list 


xs) 



Incremental sums are used to transform arbitrary lists to multisets and sets, 
inverted by pairwise differences. 



Iist2mset, mset21ist, list2set, set21ist :: [n] — >• [n] 

list2mset ns = tail (scanl add e ns) 
mset21ist ms = zipWith sub ms (e:ms) 

list2set = (map s') . Iist2mset . (map s) 
set21ist = (map s') . mset21ist . (map s) 



By composing with natural number-to-list bijections, we obtain bijections to 
multisets and sets. 

to_mset, to_set :: n— > [n] 
from_mset, from_set :: [n] — >n 

to_mset = list2mset . to_list 
from_mset = from_list . mset21ist 

to_set = list2set . to_list 
from_set = from_list . set21ist 

We will add the usual instances to the type class Collections. A simple number- 
theoretic observation connecting 2™ and n applications of the constructor i, 
implemented by the function vmul, allows a shortcut that speeds up the bijection 
from lists to natural numbers, by overriding the functions c , c ' , c ' ' in instance 
T. 

instance Collections B 
instance Collections Integer 
instance Collections T where 
c = cons where 

cons n y = s (vmul n (s' (o y) ) ) 

c ' = hd where 
hd z | o_ z = T 
hd z = s x where 

V x _ = s' z 

c" = tl where 

tl z | o_ z = o' z 
tl z = f is where 

V _ xs = s' z 



f [] = T 




As the following example shows, trees of type T offer a significantly more compact 
representation of sparse sets. 



*Giant> from_set (map t [1,100,123,234]) 

W (V T []) [V T [T,W T [],T],T,V T [V T [] ,T] ,T,V T [¥ T [],T,T]] 
*Giant> from_set (map n [1,100,123,234]) 

27606985387162255149739023449108112443629669818608757680508075841159170 
*Giant> from_set (map b [1,100,123,234]) 

I (I (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 (0 ... 

... a few lines . . . 

..))))))))))) 

Note that a similar compression occurs for sets of natural numbers with only a 
few elements missing, as they have the same representation size with type T as 
the dual of their sparse counterpart. 



Giant> from_set ( [1 ,3, 5] ++ [6 . . 220] ) 

3369993333393829974333376885877453834204643052817571560137951281130 
*Giant> t it 

W (V T []) [T.T.T.W (W T [] ) [T,T,T,T] ] 
*Giant> dual it 

V (V T []) [T,T,T,W (W T [] ) [T,T,T,T] ] 
*Giant> to_set it 

[T,V T [] ,W T [T],W T [T,W T [],T,T]] 

*Giant> map n it 

[0,1,4,220] 

As an application, we can define bitwise operations on our natural numbers 
by borrowing the corresponding ordered set operations, provided in Haskell by 
the package Data. List . Ordered. 

First we define the type class BitwiseOperations and the higher order func- 
tion l_op transporting a binary operation from ordered sets to natural numbers. 

class Collections n => BitwiseOperations n where 
l_op :: ([n]->[n]->[n])- 



Next we define the bitwise and, or and xor operations: 

l_and , l_or , l_xor , l_dif : : n— S-n— >n 
l_and = l_op (isectBy cmp) 
l_or = l_op (unionBy cmp) 
l_xor = l_op (xunionBy cmp) 
l_dif = l_op (minusBy cmp) 

More complex operations like the ternary if-the-else can be defined as a 
combination of binary operations: 

l_ite : : n— >n— >n— >n 

l_ite x y z = from_set (ite (to_set x) (to_set y) (to_set z)) where 
ite d a b = xunionBy cmp e b where 
c = xunionBy cmp a b 
e = isectBy cmp c d 

Finally, bitwise negation (requiring additional parameter 1 defining the bitlcngth 
of the operand) can be defined using set complement with respect to 2 l — 1, 
corresponding to the set of all elements up to 1. 

l_not : : Integer— 5>n— >n 

l_not 1 x |xl<l = from_set (minusBy cmp ms xs) where 
xs = to_set x 
xl = genericLength xs 



We will next generalize the iso-functor mechanism implemented generically by 
l_op that transports operations back an forth between data types as a special 
data type, consisting of two higher order functions inverse to each other. 



7 Isomorphisms between data types, generically 



Along the lines of [B] we can define isomorphisms between data types as follows: 
data Iso a b = Iso (a— ¥b) (b— >&) 

from (Iso f _) = f 
to (Iso _ f) = f ' 



Morphing between data types as well as "lending" operations from one to another 
is provided by the combinators as, landl and land2: 



as that this x = 


to that 


(from this x) 


lendl opl (Iso f 


f ') x = 


f (opl (f x)) 


lend2 op2 (Iso f 


f) x y 


= f ' (op2 (f x) (f y)) 



Assuming that the Haskell option NoMonomorphismRestriction is set on, we 
can now define generically "virtual types" centered around type class N: 



nat = Iso id id 
list = Iso from_list to_list 
mset = Iso from_mset to_mset 
set = Iso from_set to_set 

This results in a small "embedded language" that morphs between various in- 
stances of type class N and their corresponding list, multiset and set types, as 
follows: 

*Giant> as set nat 1234 

[1,4,6,7,10] 

*Giant> map t it 

[V T [],W T [T],W (V T []) [],V (W T [] ) [] ,W (V T [] ) [T] ] 

*Giant> as nat set it 

W (V T []) [V T [],T,T,V T [] ,V T [] ] 

*Giant> n it 

1234 

*Giant> lendl s set [0,2,3] 
[1,2,3] 

*Giant> lend2 add set [0,2,3] [4,5] 
[0,2,3,4,5] 

8 Complexity reduction in other computations 

A number of other, somewhat more common computations also benefit from 
our data representations. The type class SpecialComputations groups them 
together and provides their bitstring inspired generic implementations. 

The function dual flips o and i operations for a natural number seen as 
written in bijective base 2. 



class Collections n => SpecialComputations n where 
dual : : n— >il 
dual x | e_ x = e 
dual x I o_ x = i (dual (o' x)) 



The function bitsize computes the number of applications of the o and i op- 
erations: 

bitsize : : n— >n 
bitsize x | e_ x = e 

bitsize x I o_ x = s (bitsize (o' x) ) 



The function repsize computes the representation size, which defaults to the 
bit-size in bijective base 2: 

— representation size - defaults to bitsize 
repsize : : n— >n 



The functions decons and cons provide bijections between N— {0} and NxN and 
can be used as an alternative mechanism for building bijections between lists, 
multisets and sets of natural numbers and natural numbers. They are based on 
separating o and i applications that build up a natural number represented in 
bijective base 2. 

decons : :n— Kn,n) 
cons : : (n,n)— m 

decons z | o_ z = (x,y) where 
xO = s' (ocount z) 
y = otrim z 

x = if e_ y then (s' .o) xO else xO 
decons z | i_ z = (x,y) where 
xO = s' (icount z) 
y = itrim z 

x = if e_ y then (s' .i) xO else xO 
cons (x,y) | e_ x && e_ y = s e 

cons (x,y) | o_ x && e_ y = itimes (s (i' (s x))) e 
cons (x,y) | i_ x && e_ y = otimes (s (o' (s x))) e 
cons (x,y) | o_ y = itimes (s x) 



Implementing decons and cons requires counting the number of applications of 
o and i provided by ocount and icount, as well trimming the applications of o 
and i, performed by otrim and itrim. 

ocount , icount , otrim, itrim :: n— >n 



ocount x | o_ x = s (ocount (o' x) ) 
ocount _ = e 



icount x | i_ x = s (icount (i' x) ) 
icount = e 

otrim x | o_ x = otrim (o' x) 
otrim x = x 

itrim x | i_ x = itrim (i' x) 
itrim x = x 

otimes , itimes :: n— >n— >n 

otimes x y | e_ x = y 

otimes x y = otimes (s' x) (o y) 

itimes x y | e_ x = y 

itimes x y = itimes (s' x) (i y) 

An alternative bijection between natural numbers and lists of natural num- 
bers, to_list' and its inverse from_list' is obtained by applying repeatedly 
cons and respectively decons. 

to_list ' : : n— > [n] 
to_list ' x | e_ x = [] 

to_list' x = hd : (to_list' tl) where (hd,tl)=decons x 



from_list' :: [n] — >il 
f rom_list ' [] = e 




One can observe the significant reduction of asymptotic complexity with re- 
spect to the default operations provided by the type class SpecialComputations 
when overriding with tbitsize and tdual in instance T. 

instance SpecialComputations Integer 
instance SpecialComputations B 
instance SpecialComputations T where 
bitsize = tbitsize where 
tbitsize T = T 

tbitsize (V x xs) = s (foldr addl x xs) 
tbitsize (W x xs) = s (foldr addl x xs) 

addl x y = s (add x y) 

dual = tdual where 
tdual T = T 

tdual (V x xs) = W x xs 
tdual (W x xs) = V x xs 



repsize = tsize where 
tsize T = T 



tsize (V x xs) = s (foldr add T (map tsize (x:xs))) 



The replacement with special purpose code for the cons / decons functions is 
even more significant: 

decons (V x [] ) = ((s'.o) x,T) 

decons (V x (y:ys)) = (x,W y ys) 

decons (W x [] ) = ((s'.i) x,T) 

decons (W x (y:ys)) = (x,V y ys) 



cons (T,T) = V T [] 

cons (x,T) o_ x = W (i> (s x)) [] 

cons (x,T) | i_ x = V (o' (s x) ) [] 

cons (x,V y ys) = W x (y:ys) 

cons (x,W y ys) = V x (y:ys) 



One can also see that their complexity is now proportional to s and s ' given 
that the V and W operations perform in constant time the work of otimes and 
itimes. The following example illustrates their work: 

*Giant> map to_list> [0..20] 

[[] , [0] , [1] , [2] , [0,0] , [0,1] , [3] , [4] , [0,2] , [0,0,0] , [1,0] , [1,1] , 

[0,0,1] , [0,3] , [5] , [6] , [0,4] , [0,0,2] , [1,2] , [1,0,0] , [0,0,0,0]] 
*Giant> map from_list' it 

[0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20] 

Note the shorter lists created close to powers of 2, coming from the longer se- 
quences of consecutive o and im operations in that region. 



9 A performance comparison 

Our performance measurements (run on a Mac Air with 8GB of memory and an 
Intel i7 processor) serve two objectives: 

1. to show that, on the average, our tree based representations perform on a 
blend of arithmetic computations within a small constant factor compared 
with conventional bitstring-based computations 

2. to show that on interesting special computations they outperform the con- 
ventional ones due to the much lower asymptotic complexity of such opera- 
tions on data type T. 

Objective 1 is served by the Ackerman function that exercises the successor 
and predecessor functions quite heavily, the prime generation and the Lucas- 
Lehmer primality test for Mcrsenne numbers that exercise a blend of arithmetic 
operations. 

Objective 2 is served by the other benchmarks that take advantage of the 
overriding by instance T of operations like exp2 and bitsize, as well as the 
compressed representation of large numbers like the 45-th Mersenne prime and 
perfect numbers. In some cases the conventional representations are unable to 
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Fig. 3. Time (in ms.) on a few benchmarks 



run these benchmarks within existing computer memory and CPU-power limi- 
tations (marked with ? in the comparison table of Fig. [3]). In other cases, like 
in the sparse set encoding benchmark, data type T performs significantly faster 
than binary representations. 

Together they indicate that our tree-based representations are likely to be 
competitive with existing bitstring-based packages on typical computations and 
significantly outperform them on some number-theoretically interesting compu- 
tations. While the code of the benchmarks is omitted due to space constraints, 
it is part of the companion Haskell file at http://logic.cse.unt.edu/tarau/ 
Research/2013/ giant . hs . 



10 Related work 

We will briefly describe here some related work that has inspired and facilitated 
this line of research and will help to put our past contributions and planned 
developments in context. 

Several notations for very large numbers have been invented in the past. 
Examples include Knuth's arrow-up notation [7] covering operations like the 
tetration (a notation for towers of exponents). In contrast to our tree-based nat- 
ural numbers, such notations are not closed under addition and multiplication, 
and consequently they cannot be used as a replacement for ordinary binary or 
decimal numbers. 

The first instance of a hereditary number system, at our best knowledge, 
occurs in the proof of Goodstein's theorem [8] , where replacement of finite num- 
bers on a tree's branches by the ordinal u> allows him to prove that a "hailstone 
sequence" visiting arbitrarily large numbers eventually turns around and termi- 
nates. 

Numeration systems on regular languages have been studied recently, e.g. in 
[9] and specific instances of them are also known as bijective base-k numbers [3]. 
Arithmetic packages similar to our bijective base-2 view of arithmetic operations 
are part of libraries of proof assistants like Coq [10] and the corresponding regular 



languages have been used as a basis of decidable arithmetic systems like (W)S1S 
[IT] and (W)S2S [T2\. 

Arithmetic computations based on recursive data types like the free magma 
of binary trees (isomorphic to the context-free language of balanced parentheses) 
are described in [13] and Q3], where they are seen as Godel's System T types, 
as well as combinator application trees. In [I] a type class mechanism is used to 
express computations on hereditarily finite sets and hereditarily finite functions. 

An emulation of Peano and conventional binary arithmetic operations in 
Prolog, is described in [15] . Their approach is similar as far as a symbolic rep- 
resentation is used. The key difference with our work is that our operations 
work on tree structures, and as such, they are not based on previously known 
algorithms. 

Arithmetic computations with types expressed as C++ templates are described 
in [16] and in online articles by Oleg Kiselyov using Haskell's type inference 
mechanism. However, the algorithm described there is basically the same as 
[15] . focusing on Peano and binary arithmetics. 

Efficient representation of sparse sets are usually based on a dedicated data 
structure |17j and they cannot be at the same time used for arithmetic compu- 
tations as it is the case with our tree-based encoding. 

In [18] integer decision diagrams are introduced providing a compressed rep- 
resentation for sparse integers, sets and various other data types. However like- 
wise P3] and [Tj, and by contrast to those proposed in this paper, they do not 
compress dense sets or numbers. 

Ranking functions (bijections between data types and natural numbers) can 
be traced back to Godel numberings [THj associated to formulae. Together with 
their inverse unranking functions they are also used in combinatorial generation 
algorithms |20)21| . 

As a fresh look at the topic, we mention recent work in the context of func- 
tional programming on connecting heterogeneous data types through bijective 
mappings and natural number encodings [22123124] . 

11 Conclusion and future work 

We have seen that the average performance of arithmetic computations with 
trees of type T is comparable, up to small constant factors, to computations 
performed with the binary data type B and it outperforms them by an arbitrarily 
large margin on the interesting special cases favoring the tree representations. 

Still, does that mean that such binary trees can be used as a basis for a 
practical arbitrary integers package? 

Native arbitrary length integer libraries like GMP or Biglnteger take advan- 
tage of fast arithmetic on 64 bit words. 

To match their performance, we plan to switch between bitstring represen- 
tations for numbers fitting in a machine word and a a tree representation for 
numbers not fitting in a machine word. 



We have shown that some interesting number-theoretical entities like Mersenne, 
Fermat and perfect numbers have significantly more compact representations 
with our tree-based numbers. One may observe their common feature: they are 
all represented in terms of exponents of 2, successor /predecessor and specialized 
multiplication operations. 

The fundamental theoretical challenge raised at this point is the following: 
can other number-theoretically interesting operations, with possible applications 
to cryptography be also expressed succinctly in terms of our tree-based data type? 
Is it possible to reduce the complexity of some other important operations, besides 
those found so far? 

The methodology to be used relies on two key components, that have been 
proven successful so far, in discovering succinct representations for Mersenne, 
Fermat and perfect numbers, as well as low complexity algorithms for operations 
like bitsize and exp2: 

— partial evaluation of functional programs with respect to known data types 
and operations on them, as well as the use of other program transformations 

— salient number-theoretical observations, provable by induction, that relate 
operations on our tree data types to known identities and number-theoretical 
algorithms 
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Appendix 

This appendix contains some additional code, used for testing and benchmarking 
our functions, grouped in the type class Benchmarks. First we define a prime 
number generator working with all our instances. 

class SpecialComputations n => Benchmarks n where 
primes : : [n] 

primes = s (s e) : filter is_prime (odds_from (s (s (s e)))) where 
odds_from x = x : odds_from (s (s x)) 



is_prime p = [p]=to_primes p 

to_primes n = to_f actors n p ps where 
(p:ps) = primes 

to_factors n p ps cmp (mul p p) n = GT = [n] 
to_factors n p ps | e_ r = p : to_factors q p ps where 

(q,r) = div_and_rem n p 
to_factors n p (hd:tl) = to_factors n hd tl 

Next we define the Lucas-Lehmer fast primality test for Mersenne numbers: 

lucas_lehmer : : n — > Bool 
lucas_lehmer p = e_ y where 

p_2 = s' (s' p) 

four = i (o e) 

m = exp2 p 

m' = s 1 m 

y = f p_2 four 

f k n | e_ k = n 
f k n = r where 

x = f (s ' k) n 

y = s' (s' (mul x x) ) 

— r = reminder y m' 

r = fastmod y m 

— fast computation of k mod 2~p-l 
fastmod k m | k = s' m = e 
fastmod k m | LT = cmp k m = k 
fastmod k m = fastmod (add q r) m where 
(q,r) = div_and_rem k m 

— exponents leading to Mersenne primes 
mersenne_prime_exps : : [n] 

mersenne_prime_exps = filter lucas_lehmer primes 

— actual Mersenne primes 
mersenne_primes : : [n] 

mersenne_primes = map f mersenne_prime_exps where 
f p = s' (exp2 p) 

The Ackerman function is a good benchmark for successor and predecessor op- 
erations: 

ack : : n— >n— >n 

ack x n | e_ x = s n 

ack ml x | e_ x = ack (s' ml) (s e) 

ack ml nl = ack (s' ml) (ack ml (s' nl)) 

Next we define a variant of the 3x+l problem / Collatz conjecture / Syracuse 
function (see http : //en . wikipedia.org/wiki/Collatz_conjecture) that, some- 



what surprisingly, can be expressed as a mix of arithmetic operations and re- 
flected list operations, to test the relative performance of some of our instances. 
It is easy to show that the Collatz conjecture is true iff the function nsyr, im- 
plementing the n-th iterate of the Syracuse function, always terminates: 



Syracuse 


: : n— >n 




— n— > c ' 


' (3n+2) 




Syracuse 


n = c ' ' ( add n ( i 


n)) 


nsyr : : n— > [n] 




nsyr n 


e_ n = [e] 




nsyr n = 


n : nsyr (Syracuse 


n) 



Finally we close our type class with the usual instance declarations: 



instance 


Benchmarks 


Integer 


instance 


Benchmarks 


B 


instance 


Benchmarks 


T 



The following example illustrates the first 8 sequences of the Syracuse function: 



*Giant> map nsyr [0..7] 

[[0] , [1,2,0] , [2,0] , [3,5,8,6,2,0] , [4,3,5,8,6,2,0] , 
[5,8,6,2,0] , [6,2,0] , [7,11,17,26,2,0]] 



Our generic benchmark function measures the CPU time for running a no 
argument toplevel function f received as a parameter. 



benchmark mes f = 


do 


x^— getCPUTime 




print f 




y^— getCPUTime 




let time=(y-x) 


'div' 1000000000 


return (mes-H-" 


: time="+f (show time)) 



The following benchmarks provide the code used in the section [9j 



bmlt = benchmark "ack 3 7 on t" (ack (t (tolnteger 3)) (t (tolnteger 7))) 
bmlb = benchmark "ack 3 7 on b" (ack (b (tolnteger 3)) (b (tolnteger 7))) 
bmln = benchmark "ack 3 7 on n" (ack (n (tolnteger 3)) (n (tolnteger 7))) 

bm2t = benchmark "exp2 t" (exp2 (exp2 (t (tolnteger 14)))) 
bm2b = benchmark "exp2 b" (exp2 (exp2 (b (tolnteger 14)))) 
bm2n = benchmark "exp2 n" (exp2 (exp2 (n (tolnteger 14)))) 

bm3 tvar = benchmark "sparse_set on a type" (n (bitsize (from_set ps))) 
where ps = map tvar [101 , 2002 .. 100000] 

bm4t benchmark "bitsize of Mersenne 45" (n (bitsize mersenne45)) 
bm5t = benchmark "bitsize of Perfect 45" (n (bitsize perfect45)) 

bm6t = benchmark "large leftshift" (leftshift n n) where 



n = t prime45 



bm3' tvar m = benchmark "to/from list on a type" 

(n (bitsize (from_list (to_list (from_list ps))))) 
where ps = map tvar [101 , 2002 .. 3000+m] 

bm3'' tvar m = benchmark "to/from list on a type" 

(n (bitsize (from_list (to_list (from_list ps))))) 
where ps = map (dual. tvar) [101 ,2002 .. 3000-hn] 



bm7t = benchmark "primes on t" 

(last (take 100 ps)) where ps = primes :: [T] 
bm7b = benchmark "primes on b" 

(last (take 100 ps)) where ps = primes :: [B] 
bm7n = benchmark "primes on n" 

(last (take 100 ps)) where ps = primes :: [Integer] 



bm8t = benchmark "mersenne on t" 

(last (take 7 ps)) where ps = mersenne_primes : : [T] 
bm8b = benchmark "mersenne on b" 

(last (take 7 ps)) where ps = mersenne_primes :: [B] 
bm8n = benchmark "mersenne on n" 

(last (take 7 ps)) where ps = mersenne_primes :: [Integer] 

The following tests the Syracuse / Collatz conjecture up to m 
test_syr tvar m = maximum (map length (map (nsyr . tvar) [0..m])) 

compress_syr tvar m = r where 
nss = map (nsyr . tvar) [0. .m] 

r = maximum (map (n. bitsize) (map from_list nss)) 

— overflows for m>2 except for tvar^t 
compress_syr_twice tvar m = r where 
nss = map (nsyr . tvar) [0. .m] 

r— (n. bitsize) (from_list (map from_list nss)) 

bm9 tvar = benchmark "test Syracuse" (test_syr tvar 2000) 

bmlO tvar = benchmark "compress Syracuse" (compress_syr tvar 100) 

bmll tvar = benchmark "compress syracuse_twice" r where 
r = compress_syr_twice tvar 20 

The following function computes the size of a tree-represented natural num- 
ber: 

tsize T = 1 

tsize (V x xs) = 1+ sum (map tsize (x:xs)) 
tsize (W x xs) = 1+ sum (map tsize (x:xs)) 

The function kth computes the k-th iteration of a function application. 



The following assertions are used for testing some of our operations: 

— relation between iterations of o,i and power of 2 

al k = pow (i e) k = s (kth o k e) 

a2 k = pow (i e) k = s (s (kth i (s' k) e)) 



— relations between power operations and multiplication 
a3 n b = (u=v,u,v) where 

m = pow (i e) n 

u = kth o n b 

v = s' (mul m (s b) ) 



a4 x y = (a=b,a,b) where 
a = mul (pow (i e) x) y 
b = s (kth ox (s> y)) 



